# MSP-Podcast Dataset
SER Odyssey Baseline WavLM Arousal
MIT
A speech emotion recognition baseline model based on the WavLM architecture, specifically designed to predict arousal values in speech (0-1 range)
Audio Classification
Transformers English

S
3loi
72
2
SER Odyssey Baseline WavLM Dominance
MIT
A speech emotion recognition model based on the WavLM architecture, specifically designed to predict dominance features in speech
Audio Classification
Transformers English

S
3loi
15
1
SER Odyssey Baseline WavLM Multi Attributes
MIT
A multi-attribute speech emotion recognition baseline model based on WavLM architecture, predicting arousal, dominance, and valence dimensions
Audio Classification
Transformers English

S
3loi
23.09k
7
Featured Recommended AI Models